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Abstract 

Many recent approximation algorithms for different variants of the traveling salesman prob¬ 
lem (asymmetric TSP, graph TSP, s-t-path TSP) exploit the well-known fact that a solution of 
the natural linear programming relaxation can be written as convex combination of spanning 
trees. The main argument then is that randomly sampling a tree from such a distribution and 
then completing the tree to a tour at minimum cost yields a better approximation guarantee 
than simply taking a minimum cost spanning tree (as in Christofides’ algorithm). 

We argue that an additional step can help: reassembling the spanning trees before sam¬ 
pling. Exchanging two edges in a pair of spanning trees can improve their properties under 
certain conditions. 

We demonstrate the usefulness for the metric s-t-path TSP by devising a deterministic 
polynomial-time algorithm that improves on Sebo’s previously best approximation ratio of |. 

keywords: traveling salesman problem, s-t-path TSP, approximation algorithm, spanning 
tree 


1 Introduction 


The traveling salesman problem (TSP) is probably the b est-known iVP-hard combinatorial opti¬ 
mization problem. Although for the general metric TSP ChristofidesI ’ 19761 algorithm with its 


approximation ratio | is still unb eaten, we ha ve seen progress for several variants and special 


cases in particular since 2010. See IVveenl |2012| for a detailed survey. 

Many of the recent approximation algorithms begin by solving the natural linear programming 
relaxation, which was first proposed bv lDantzig, Fulkerson and JohnsonI |l954l | . It was observed by 
Held and Karp 1970l ] that a solution x* to this LP can (after scaling down by a factor except 
for the s-t-path case) be written as convex combination (or, equivalently, probability distribution) 

of spanning trees. Of cou rse, t his distribution is far from unique . _ 

Asadpour et al.l |20inl | and lOveis Gharan, Saberi and SinghI |2nil| improved the approxima¬ 


tion ratio for the asymmetric TSP and graph TSP, respectively, by randomly sampling a spanning 
tree from a m aximum entropy distribution descr ibing and then completing it to a tour in an 

optimal way. lAn, Kleinberg and Shmovsl |2015l | considered the metric s-t-path TSP and showed 
that a spanning tree randomly chosen from any distribution describing x* is —in expectation— 
good enough to improve on Christohdes’ algorithm for this problem. 
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In this paper, we propose to modify the distribution before sampling. By exchanging two edges 
in a pair of trees with certain properties we obtain two new trees and hence a new distribution. 
We call this step reassembling trees. Under certain conditions the two new trees have better 
properties than the old ones. For the s-f-path TSP we show that this step can indeed improve 
the approximation ratio. 


1.1 The s-t-path TSP 


Let us dehne the (metric) s-f-path TSP formally. As in the classical version of the TSP, we want 
to visit a set of cities at minimum total cost. However, rather than returning to the origin at the 
end, we are given the origin and the destination as input. More precisely, we are given a finite 
set V, two elements s and t of V, and a symmetric distance function c :V x U —>■ M>o satisfying 
the triangle inequality. Throughout this paper we will denote by n the number of elements of V. 
We ask for a sequence V = {ui,... ,Vn} with ui = s and Vn = t, such that YllZi c(uj,Ui+i) is 
minimized. 

The classical metric TSP is the special case when s = t. We note that in both variants we can 
either require that every city is visited exactly once, or, equivalently, we can drop the requirement 
that c satishes the triangle inequality but allow visiting cities more than once. 

The s-t-path TSP (for s ^ t) is clearly no easier than the classical metric TSP (we can reduce 
the latter to the former by guessing two cities that are adjacent in an optimum tour); in particular 
there is no polynomial-time approxima tion algorithm with better ratio than ^ unless P = NP 
( Karpinski. Lampis and SchmiedI 2013 |). 

We assume s 7 ^ t henceforth. It is sufficient to compute a connected multi-graph with vertex 
set V in which exacty s and t have odd degree; such a graph will be called an {s,t}-tour. Here 
is why: given an {s, f}-tour, we can hnd an Eulerian walk from s to t (using every edge exactly 
once) in linear time and shortcut whenever a vertex is visited not for the hrst time; this yields an 
s-f-path with vertex set V, which —due to the triangle inequality— is no more expensive than 
the {s, t}-tour. 


1.2 Previous approximation algorithms 

Christohdes ’ |l97d | algorithm, originally designed for the classical metric TSP, works also for 


the s-t-path TSP. It first computes a minimum cost spanning tree (V, S) in the complete graph 
spanned by V, and then adds a minimum cost Ts-join J, where T 5 is the set of vertices whose 
de gree has the wrong parity (even for s or t, odd for other vertices). The result is an {s,t}-tour. 
HoogeveenI 199ll | showed that Christohdes’ algorithm has approximation ratio t for the s-t- 


pat h TSP, and that in fact this ratio is asymptotically attained by an inhnite set of examples. 


An. Kleinberg and Shmovsl 2015l | proposed the hest-of-many Christofides algorithm for the s- 


t-path TSP and proved that it has approximation ratio ~ 1.619. The algorithm is quite 

simple: it computes an optimum solution to the natural LP relaxation (see ([T]) below) and writes 
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it as convex combination of spanning trees. For each of these spanning trees, (V^, S'), it computes 
a minimum weight Tg-join J, where Ts is the set of vertices whose degree has the wrong parity, 
obtains an {s, t}-tour S (j J, and finally outputs the best of these. We will go into details 
in Subsection 11.41 Thi s was the fi rst improvement over Christofides’ algorithm that applied to 
general metrics. Then Sebd 2013 | improved the analysis, obtaining the approximation ratio |. 

We will describe how in Subsection 11.81 

For the special case where c is the metric closure of an unw e ighted graph, bett er approximation _ 

algori thms have been obtained bv iMomke and SvenssonI |201ll |. lMuchal |2014l |. and Kleinbe rg and Shmoys 


2015 1 (in this order). The best known approximation ratio 1.5, o btained firs t bv ISebd and Vygen 


20141 ]. matches the integrality ratio of the LP in this special case. iGad |2013l | gave a simpler proof 


of this result. 

Some of the above-mentioned papers apply also to a generalization (to T-tours, for general 
T); see Section [6] for a brief discussion. 


1.3 Notation 

For a given instance {V,s,t,c) let n := |14|, and let E := (^) be the edge set of the complete 
graph on V. For C/ C 14, 6(17} denotes the set of edges with exactly one endpoint in U, and E[U] 
the set of edges with both endpoints in U. We write 6{v) := 5({u}) for v & V. For TCI/ with 
|T| even, a T-join is a set J C E for which \6(v) fl J| is odd for all u E T and |(5(u) n J| is even for 
all u E 1/ \ T. A T-cut is a cut 6(U) for which |f 7 n Tj is odd. T he intersection of a T-join and a 
T-cut always contains an odd number of edges. EdmondsI 1965 ] proved that a minimum weight 
T-join can be computed in polynomial time. 

For a vector x E and E C. E we write x{E) := XleeC and c{x) := Yle={v w}&E w)xe', 
moreover, £ { 0 , 1 }^ denotes the incidence vector of F (i.e., Xe — ^ e E T and Xe ~ ^ 
for e E T \ T), and c(F) := c(x^) = Yle={v c(v,w). By S we denote the set of edge sets of 
spanning trees in (V, E). For S ^ S, the set T5 := {u E E \ {s, t} : | 5 (u) G S\ odd} U {u E {s, t} : 
|5(u) n S\ even} contains the vertices whose degree in S has the wrong parity. 


1.4 Best-of-many Christofides 


O ur algorithm will be an extension o f the best-of-many Christofides algorithm that was proposed 
bv lAn, Kleinberg and Shmovsl 2015| . Therefore, we hrst describe this algorithm in more detail. 
The algorithm begins by solving the LP relaxation 


min c(x) 
subject to 
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Obviously, the integral solutions to m are precisely the incidence vectors of the edge sets of 
the Hamiltonian g-f-pat hs in (V,E). So t his L P is indeed a relaxation. 

Following an idea of Held and Karnl |l970l |. An. Kleinberg and Shmovs 20151 observed that 
the polytope defined by ([H is {x E M^q : x{E) = n—1, x{E[U]) < \U\ — 1\/$ U C V, x{E\U]) < 
\U\ — 2 V{s,t} C U C H} and hence is contained in the spanning tree polytope of {V, E) (Edmond^ 
197[)| i. Therefore, an optimum solution x* (in fact every feasible solution) can be written as 


X* = where p is a distribution on S, i.e., > 0 for all 5 E 5 and = 1- 

By Caratheodory’s theorem we can assume that ps > 0 for less than spanning trees {V, S). 
An optimum LP solution x*, such spanning trees, and such n umbers pg can be computed in 
polyn omial time, as can be shown with the ellipsoid method (IGro tsch eL Lovasz and Schriiver 
19811) or using the splitting-off technique (cf. Genova and Williamsonrj2015l |L 

We will fix X* henceforth. We will also fix the distribution p for the rest of the introduction 
but will modify it later. 

For each S' E 5 with ps > 0, the best-of-many Christofides algorithm then computes a 
minimum weight T^-join J, and considers the {s,f}-tour S U J; the output is the best of these. 
Note that trying all S E S with ps > 0 leads to a deterministic polynomial-time algorithm which 
is at least as good as randomly picking S E S with probability ps- 


1.5 Basic Analysis 


We follow the basic analysis of lAn. Kleinberg and Shmovsl 20151 1 . The cost of the {s,t}-tour 
that the best-of-many Christofides algorithm computes for a given instance depends on its choice 
of p only: it is BOMC(p) := min 5 'g 5 :pg>o (c(S') -|- min{c(J) : J is a Tg-join}). This is at most 
J2s&sPs (ciS) -f min{c(J ) : J is a Ts-ioinl). Note tha t YlsesPSc{S) = c{x*). 

A well-known result of lEdmonds and Johnsoru |l973l | says that the minimum weight of a r 5 -join 
is the minimum c(y) over all y in the T 5 -join polyhedron 


{y G ^>0 ■ y{C) > 1V r^-cuts c]. 


( 2 ) 


Therefore 

BOMC(p) < c{x*)+ '^psc{y^) (3) 

SeS 

for any set of vectors {y^)s&S such that y^ is in the T 5 -join polyhedron ([2]). The difficulty in the 
analysis li es in finding an appropriate set of vectors {y^)s&S- Let us call them correction vectors 
(although lAn. Kleinberg and Shmovsl |2ni5l | used this term with a different meaning), because 
they bound the cost of parity correction. 


1.6 Narrow cuts 

Let C := {S{U) : 0 7 ^ C H, x*{5{U)) < 2}. The elements of C are called narrow cuts. By 
m they are {s,t}-cuts. If there are no narrow cuts except 5{s) and 6{t), then ^ is a valid 
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correction vector for all S £ S, leading to BOMC(p) < |c(x*), which is IWolsevl ’s 1980l | analysis 
of Christofides’ algorithm for the classical metric TSP. But in the s-t-path TSP we will in general 
have narrow cuts. 

The following property is useful: 

Lemma 1 ( An. Kleinberg and Shmovsl (2015 |1 The narrow cuts form a chain: there are sets 
{s} = Lq C Li C • • • C Li_i C Li = V \ {t} so that C = {h{Li) : i = 0, ..., i}. 


Proof: We have x*((5(s)) = x*{6{t)) = 1 and hence (5(s) G C and 5{V \ {t}) = 5{t) £ C. Suppose 
we have U' dV and U" C V with h(U'), S(U") £C and s £U'n U" i {[/', [/"}. Then t^U'd U" 
and 2 + 2 > x*{5{U')) + x*{5{U'')) > x*{5{U'\U'')) + x*{5{U''\U')) >2 + 2, a contradiction. □ 

We remark: 


Proposition 2 Given x*, the set C of narrow cuts can be computed in polynomial time. 


Proof: Lemma [T] implies that there is an order V = {vi,V 2 , ■ ■ ■ ,Vn} such that each narrow cut 
has the form (5({ui,...,+}) and is the only {uj, Ui+i}-cut C with x*{C) < 2. Thus it suffices to 
compute a {u,tc}-cut C with minimum x*{C) for all v,w £V with v ^ w. This can be done by 
( 2 ) applications of any polynomial-time max-flow algorithm (or more efficiently by computing a 
Gomory-Hu tree). □ 

Similarly to Lemma [1] we have: 


Lemma 3 For all C,C' £ C with C C we have 

x*{Cr\C') < ^x*{C) + ^x*{C') -1. 


Proof: Let C = 5{Li) and C = S{Lj) with 0 < i < j < £. Then x*{C) + x*{C') 
x*{5{L,\Li))>2. 


Moreover, An. Kleinberg and Shmo^ |2015| and Sebd |2013| observed that 


2x*{CGC') = 
□ 


pLn := PS < x*iC)-l (4) 

5e5:|SnC|even 

Pone ■■= ^ PS > 2-X*iC) (5) 

se-S:|5nc|=i 

for every narrow cut C £ C. Note that Q and ([5]) follow directly from x*{C) = H C\ 

for a\\ C d E and [S' n C*! > 1 for all C G C and S £ S. From this we also get 


P 


C 

many 


Y.PS 

s&s 


5nC| -1 
2 


- [x*{C) - 1 -pLn) 


( 6 ) 
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Figure 1: A spanning tree {V, S). Filled circles show vertices in T 5 . The edge set S is partitioned 
into its s-t-path Is (turquoise) and its T^-join Js (red). The narrow cuts are those shown in grey 
(dotted and solid). Those that have an even number of edges of S are shown in solid grey; each 
of them contains (at least) one red and (at least) one blue edge. 


for all C G C. 

For S G 5 let Is denote the edge set of th e s-t-path in (1/, S). Let Js '■= S\ Is and note that 


Js is a r 5 -join. See Figured] for an example. Sebd 2013| observed that 


X] X] < '^PSX^^ (7) 

cec se5:|Snc|=i s&s 


because Is (in fact every s-t-path) intersects every narrow cut (in fact every {s,t}-cut), and if 
S n 6{Li) = 5 n 5{Lj) = {e} for some 0 < i < j < i and e G i?, then S n S{Lj \ Li) = 0, 
contradicting that [V, S) is connected. 


1.7 Correction vectors 

Consider 

/ ;= Px* + {1 - 2P)x-^s + ( 8 ) 

for S G S, where 0 < /3 < and G M>q is a nonnegative vector satisfying 

z^iC) > /3i2-x*iC)) (9) 

for all C G C with [S' H C*! even. 

Lemma 4 ( An. Kleinberg and Shmovsl (2015 |1 For every S G S and every Ts-cut C we have 

/(C) > 1. 


Proof: Let S G S and C he a Ts-cut. Js is a Tg-join, so | Js n Cl > 1. 

If C ^ C, then x*{C) > 2 and hence y^{C) > f3x*{C) + (1 — 2/3)| Js n C| > 2/3 -|- 1 — 2/3 = 1. 
If C G C, then |Js H C| (the intersection of an s-t-path and an {s,t}-cut) is odd and | Js H C| 
(the intersection of a Ts-join and a Ts-cut) is odd, so jS" H C| is even. Hence y^{C) = j5x*{C) + 
(1 — 2/3)1 Js n C| -|- z^{C) > /3x*(C) + 1 — 2fj + z^{C) > 1 due to ([9]). □ 
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So is in the Tg-join polyhedron ([2]) for all 5 G 5. For any distribution p and nonnegative 
vectors {z^)seS with ([9]), we get with ([3]) and jH]) the bound 

BOMC(p) < {1 + /3)c{x*) + {I -2j3)'^psc{Js) + '^psc{z^)- (10) 

ses ses 


Now the question is how to choose the vectors z^. 


1.8 The analyses of An, Kleinberg, Shmoys, and Sebd 

An. Kleinberg and Shmo^ 2015l |. Sebd )2013 |. and then also Gad |2015 |. chose 

z^ = ^ max {O, 2/3 — a —/3x*(C')} r;*", 

CeCilSTlCleven 


( 11 ) 


where G M>q are vectors with v^{C) > 1 for all C G C. 

As /g n (7 7 ^ 0 (recall that Is is the edge set of an s-t-path), this choice implies z^{C) > 
a + (2/3 — a — f3x*{C)) = 2/3 — fjx*{C) for all C G C with 15(7(71 even, as required in ([9]). 

Writing Ip := and Jp := Y^s&sPsX"^^ we get, with (fTO]) and ([IT]) and c{Ip) + 

c{Jp) = c{x*): 


BOMC(p) < 


< 


(1 + /3)c{x*) + (1 - 2l3)c{Jp) + ^ psc{z^) 

ses 

(1 + I3)c{x*) + (1 - 2P)c{Jp) + ac{Ip) 

+ ^ max |0, 2/3 — a —/3x*((7)} c(u*^) 

SeS CeC:|SnC|even 

(1 + a + /3)c(x*) + (1 - a - 2/3)c{Jp) 

+ l^CTenmax{0, 2(3 - a - I3x* {C)} c{v^) 
cec 

(1 + a + (3)c{x*) + (1 - a - 2(3)c{Jp) 

+ ^^(x*((7) — 1) max {0, 2(3 — a — /3x*((7)} c(u^), 
c&c 


where we used (|4|) in the last inequality. The three papers choose the vectors {C ^C) differently. 

< 

V / Z—rfL-'tC- - 

X 


An. Kleinberg and Shmoys |2015 | showed that {C G C) can be chosen so that H 


. Observing (x — 1) (2/3 — a — j3x) < for all x G M and setting /3 = -^ and a = 1 — 

they obtained BOMC(p) < c{x *). 

Snc for (7 G C. Then indeed {C) > 1 for 


Sebd |2013| chose ■.= T.s&S:\SnC\=iPsx' 


all (7 G C due to dSj). Using d?]) and observing ^ < a + (3 — 2^/^ for all x < 2 one 
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gets BOMC(p) < (1 + a + I3)c{x*) + (1 — a — 20)c(Jp ) + {a + (3 — 2y/al3)c{Ip). Setting /? = | and 


a = ^, this yields BOMC(p) < |c(x*). ( Sebd 201df set /? = | and a = ^, which yields the same 
bo und i f c( Jp) > |c(x*); otherwise he used {S G 5) as correction vectors). 

Gaol 2015| simply chose to be the incidence vector of a cheapest edge in C, which is clearly 


best possible in this framework, but he could not obtain a better approximation ratio. 


1.9 New approach 

We use the ideas of An. Kleinberg and Shmovs |2015 | and Sebd 20131 but define differently; 
see Section[2l Like lSebdl 2013 |. we will bound c{z^) by a constant fraction of c{Ip). More precisely, 


we will find vectors z^ G K>g with ([9]) and Psc{z^) < (1 — 2/3)c{Ip). With (fTOl) this will 

immediately yield BOMC(p) < (2 — j5)c{x*). The question is how large we can choose /?. 

In the Appendix we give an example th at sho w s that it will in general not be possible to 
choose /? > I and hence directly improve on SebdI ’s 20131 approximation ratio of |. Therefore 
we will modify p first. By reassembling the trees that contribute to the convex combination 
X* = '^sesPsX^ we eliminate the most critical configurations. We will show how in Section [3l 
Then we can complete the calculation in Section |5] and obtain an improved approximation ratio. 


2 New correction vectors 


Since we want to bound the weighted sum of the costs of the vectors z‘^ by a multiple of c{Ip) 
(the weighted sum of the costs of the s-t-paths Is, S' G 5), each pair (S, e) with S G S and e £ Is 
will make a c ontrib ution to some of the vectors z^ (where S' £ S can be S or a different tree). 


As in SebdI ’s 2013l | analysis, there are two types of contributions. A pair (S, e) will contribute 
7s,e(l — 2/3) to z^ and a total of (1 — xs,e){^ ~ 2/3) to vectors z^ for other trees S'. The latter 
contribution is distributed as follows: if C G C is the narrow cut with S H C = {e} (there c an be 
only one such cut), then e will contribute to z^ for all S' £ S with |S' n Cl even. ISebdI ’s 2013l | 
analysis is essentially equivalent to choosing ^s,e = \ for all S G S and e £ Is- We will obtain an 
improvement by choosing individual values. 


2.1 The new vectors 

For any S £ S and C £ C choose an edge Cq £ Is Ci C. Moreover, for any C G C let ec G C be a 
minimum cost edge in C. For any pair S £ S and e £ Is we will choose a number 0 < 'js^e < 1 
later. Then, for S £ S, we set 

z^ := ^(1 - 2/3)75,™ax|o, (^/3(2-x*(C)) - (1-2/3)75_gs)}x^^c:}_ (12) 

e£ls CsC: |SnC| even 


The first term in (1121) is the direct contribution of the edges of Is to z^. The second term is 
exactly what is still needed to obtain ([9]) for all C G C with jS* H C| even; see the proof of Lemma 





































[5] below. We will have to show that the total cost of the weighted sum of the second terms in (I12p 
is not more than '^sesPS ~ “ 75 ,e)c(e); see Section[221 

Lemma 5 If 0 < /3 < i and Y^sesPsci^^) < (1 “ 2/3) YsesPSc{Is) for the vectors (S eS) 
defined in (1121) . then BOMC(p) < (2 — f3)c{x*). 

Proof: Note that the vectors are nonnegative for all S G 5 (as jd < \ and 75^6 > 0 for all 
e G Is)- We have for every S ^ S and C G C with [S' n C*! even: 

z^{C) > (l-2/3)7^^^g + (/3(2-x*(C))-(l- 2/3)75, ,s) = /3(2-x*(C)), 

as required by ([9]). So the bound immmediately follows from (llOp . using YsesPS^i^^) — ~ 

2/3) YsesPsc{Is) and YsesPsi^iJs) + c{Is)) = c{x*). □ 

We will try to maximize /3. 


2.2 Bounding the cost 

The cost of the vectors z^ {S G S) will of course depend on the choice of the 'js,e (5* G 5, e G Is)- 
The desired inequality YsesPs^i^^) ^ (1 “ 2/3) YsesPso{Is) is implied by 

P 5 max|o, /3(2-x*(C')) - ( 1 - 2 / 3 ) 75 , gs} < Y 1 ^5(1 “ 2/3)(l - 75 ,^ 3 ), 

5G5:|SnC|even Se5:|SnC|=l 

as may be seen from (I12h and will be formally shown in the proof of Lemma [7] below. To write 
this inequality in a more compact form (see (I13p below), we divide by 1 — 2/3 and use the following 
notation: 


Definition 6 Given numbers 75,6 > 0 for S £ S and e G Is: we define the benefit of{S, C) £ S xC 
to be bs,c ■= min | ’ "Is.eg} ^ C\ is even, bs,c := 1 - 7s,eg */ I-? H C] = 1, and 

bs,c = 0 otherwise- 


Now all we need to show is that we have enough benefit at every narrow cut: 


Lemma 7 Let 0 < /3 < ^ and 0 < 75,6 < 1 for S £ S and e £ Is- If 


Yp^^s,c > 

SeS 


fi{2-x*{C))pg,, 

1-2/3 


for all C £ C, then BOMC(p) < (2 — /3)c(x*). 


(13) 
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Proof: By (I13p and Definition [6] we have for every C G C: 


/3{2-x*{C))pg, 

1-2/3 

We compute: 

EsesPSciz^) 


< PS ’ ^^-4 1 ^ ^'s(l-7s,eg)- (14) 

Se5:|5nC|even 1 7 J 5g5:|5nC|=l 


1 - 2/3 


X] X] ^S,eC{e) + Y1 1 


SeS 


v^s/s 


CeC:|5nC|even 


/3{2-x*{C)) 
1 - 2(3 


7s,eg) } c(ec) 


^PsY^ 7s,eci^) 
ses eels 


+ E 

cec 


l3{2-x*iC))p: 


c 

even 


1-2/3 


PS ™in 

SeS:|SnC|even 


f /3(2-x*(C)) 
{ 1-2(3 ’ 


c(ec) 


P5(l -7s,eg)c(ec) 

ses eels ceC seS:|Snc|=i 

- Yp^Y Ts.ecCe) + Y Y P^^^ ~ 'Ts,eg)c(ec) 

ses eels ceC se5:|Snc|=i 

- Yp^Y 'Ts,ec(e) + Yp^Y^^~ ^s,e)c{e) 

ses eels ses eels 

ses 

(we used ([Hll in the first, 7s eg — 1 ^^*1 c{ec) < c(e^) in the second, and ([7]) in the third 
inequality). Now the assertion follows from Lemma [5l □ 

Let us quickly check the obvious (although we will not need it): if we simply choose 7 s,e = ^ 
for all S' G 5 and e & Is, then we have for all C G C: if p^^(2 — x* (C)) < then Ylses Psbs,C 
j-^(2 — x*(C))p^gjj, and otherwise 


Ypsbs,c = pYl+PLnl > ^((2-X*(C))+pfven) > 2{2 - X* {CM 


'even 


SeS 


(the hrst inequality follows from (l5|), and the second one follows from 0 < (2 — x*{C)) +Peven — 1 
(cf. dH)) and the fact that a + b > Aab for all a, b > 0 with a + b < 1). 

So we have (I13p for /3 = I and obtain again ISebd ’s 2013l | approximation ratio This is tight 
for x*{C) = I and p^g = p^en = when the total benefit J2sesPsbs,C is 
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2.3 More benefit 


Let us first give an informal description of our idea. The approximation guarantee can be readily 
improved if there are no cuts C with x*{C) ~ | (and p^ne ~ k Peven ~ the previous 

paragraph). We would like to obtain more benefit (i.e., more than i) for “critical” cuts C (i.e., 
those with x*{C) ~ |), and to achieve this we will reduce the beneht for less critical cuts (but 
still have enough). If C is a critical cut, S' E 5, and does not belong to any other critical cut, 
we can choose 75^6 > 5 if |SnC| is even and ^s,e < 5 if|SnC| = 1 . We will describe how exactly 
at the beginning of Section HI In this case we get bg^c > \- 

However, this does not work if, for a critical cut C and every S ^ S, the edge also belongs 
to another critical cut C' (and |S H Cl + |S n C'\ = 3). In this case we can only choose 'ys,e = \ 
and get But note that then S has at least one edge in C n C. However, due to Lemma 

[3l there cannot be too many edges in the intersection of two critical cuts: certainly less than one 
per tree on average. 

We will fix a constant ^ betwe en | and 2, and consider '■= \ C E C : x*{C) < ^}. These 
cuts were called (^ — l)-narrow bv lAn~ Kleinberg and Shmovs 2015 |. but we prefer to call them 
^-narrow. Cuts that are not ^-narrow will not be critical. For any C E \ {h(s),h(t)}, let C^— 
and C_^. be the adjacent ^-narrow cuts in both directions. More precisely, if C = S{Lj), then 
:= 6{Li) and C_^. := 5{Lk), where i = max{r : i < j, x*{6{L^)) < and k = min{«: : k > 
j, x*{S{Lk,)) < (cf. Lemma [T]). 

Let C be a critical cut. By Lemma [3l x*{C C C<_) < 1 and x*{C H C_^) < 1, so there must 
be trees S £ S with pg > 0 with less than two edges in the disjoint union (C H C<_) U (C C C_j.). 
Therefore, if every tree S with pg > 0 has either larger benefit than ^ or (benefit exactly ^ 
and at least two edges in (C H C^-) U (C PI C_^)), we get '^g^gPsbg,c > which leads to an 
improvement. This will be essentially our argument. 

As can be seen from Figure [21 there are however —in addition to trees with more than two edges 
in C — four configurations (orange and purple) that do not have this property. Our reassembling 
step, to be described next, aims at avoiding two out of these four configurations (one of the two 
orange ones and one of the two purple ones). It will turn out that this is enough. 


3 Reassembling Trees 

In this section we show how to reassemble the trees that contribute to the convex combination 
X* = '^g^s PSX^ order to remove certain bad configurations. 

Given a constant | < ,^ < 2 (the exact value will be chosen later), we number the ^-narrow 
cuts = {Co,..., Ci'} from left to right; i.e., for 0 < i < k < f and Ci = S{L^) and Ck = ^(Lk) 
we have t < k. Note that Cq = 5{s) and Cg/ = S{t). Moreover, for C = Cj {i E {1, ... ,i' — 1}) we 
have C.(_ = Cj_i and = Cj+i. 
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Figure 2: Configurations at a ^-narrow cut C; here and are the adjacent ^-narrow cuts. 
The types, according to Definition |8l are shown on the right. In each configuration we see the (one 
or two) edges of S'C C for some S G 5; if there are two, the edge that belongs to the s-f-path is 
shown on top. If C is critical (i.e., x*{C) ~ |), each of these configurations will get benefit at least 
Green configurations will get benefit strictly more than which is possible because Cq belongs 
to no other critical cut. Blue configurations have two edges in {C H C^-) U (C C C-^), which is 
more than possible on average. Orange and purple configurations are problematic. Configurations 
with even more edges in (C C C^) 0(0 0 C^) or with three or more edges in C (these will be 
called good) are not shown. 

3.1 Types of trees at ^-narrow cuts 

We now define the type of a spanning tree at a .^-narrow cut. We remark that the type can depend 
on the value of This is no problem since ^ is a fixed constant (chosen later). 

Definition 8 Fix a constant | < ^ < 2. Then, for any tree 5* G 5 and any cut C G C^\{6{s), 6{t)} 
we define the type of S atC as follows. Let I = |S'nC'nC'<_| and m = IS'nC'l and r = |S'nC'nC'_^|. 
If m > 3 or I + r > 3 or {I + r > 1 and S CiC {e} for all e £ S DC and all C G C^), then the 
type is “good” otherwise the type is “Imr”. 

Note that if the type of S' at C is GOOD, then |S C Cl >2. See Figure [2] for a list of all types 
that are not GOOD, and Figure [3] for the types of the tree in Figured] (assuming that all narrow 
cuts are in C^). All types are also listed in Tabled) on page dHi 
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Figure 3: The spanning tree {V,S) from Figured) Assuming that all narrow cuts (grey vertical 
lines) are ^-narrow, we list the type of S at each C £ \ {(5(s), (5(t)} according to Definition [H) 

For the cut C marked GOOD we have / = 0, m = 2, r = 1, and S' n C" 7 ^ {e} for all e G S H C and 
all C' G 

3.2 Reassembling lemma 

Here is our key lemma for reassembling trees. 

Lemma 9 Let i G {1 ,... — 1}, so Ci £ \ {(5(s), Let Si,S 2 G S sueh that Si has type 

120 at Ci and S 2 has type Oil at Ci. 

Then there are two edges ei G Si and 62 £ S 2 such that S( := (Si \ {ei}) U { 62 } G S and 
S 2 := (S 2 \ { 62 }) U {ei} G S. Moreover we have: 

(a) At each of Ci,..., Ci-i, the new trees have the same type as the old trees. 

(b) At Ci, S^ has type 121, and S 2 has type 010. 

(c) If S^ has type 110 or 021 at Cj for some j > i, then Si has the same type at Cj. 

(d) If S 2 has type 110 or 021 at Cj for some j > i, then S 2 has the same type at Cj or S^ has 
type GOOD at each of Ci^i,... ,Cj. 

See Figure ID for an example. 

Proof: Let 62 be the only edge in S 2 H Cj. As S 2 has type Oil at Cj, there is an index k £ 
{i + 1, ... — 1 } such that 62 belongs to the cuts Cj, ..., Cfc but neither Cj_i nor Ck+i- 

Let Si n Cj = {eo, ei}, where cq belongs to the s-t-path in Si (so Ig^ n Cj = {eo}). As Si has 
type 120 (and not GOOD) at Cj, there must be an edge in Si n Cj that is the only edge of Si in 
some other ^-narrow cut, and such an edge must belong to the s-t-path in Si. So there exists an 
index /i G {0,..., f — 1} such that C/* n Si = {eo}; moreover ei ^ Cj-i U Cj+i. 

The graph {V, Si U { 62 }) contains a circuit A. Any circuit has even intersection with any cut. 
As 62 ^ C/i, the circuit A does not contain eo- However, A must contain (at least) a second edge 
in Cj besides 62 ; so A contains ei. 
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Figure 4: Reassembling trees: exchanging two edges in trees of types 120 and Oil (Lemma [9|). 


We have S'l G S because ei belongs to the circuit A in {V, Si U { 62 })- We have 5^ G 5 because 
52 n Cj = { 62 } and ei G C,. 

Property (a) is obvious as nothing changes for Cq, ... ,Ci-i: note that 61,62 G C* \ Ci-i. 
Property (b) follows from S{riCi = {cq, 62 } and n Cj = { 61 }. 

To show (c) and (d), we first prove: 

Claim: has type GOOD at Cj+i ,... ,Ck- 

To this end, observe that n Cj| > |(5i \ { 61 }) n Cl, | + 1 > 2 for j = i,..., fc. 

Let V 2 be the “right” endpoint of 62 , and vq the “right” endpoint of cq (i-e., if C, = d{Li_), 

then Vo,V 2 ^ cf. Figure 01) Let P be the uo-U 2 -path in S[. We observe that (*) P crosses 
every cut among Cj+i ,... ,Ck an odd number of times and every cut Cj with j < i or j > k an 
even number of times. Therefore P contains neither cq (it contains an even number of edges from 
S[r\Ch = { 60 }) nor 62 (it contains an even number of edges from S'lClCi = {cq, 62 }). 

Now let j G {i + 1,..., k} and e G Is[ H Cj. We have 6 7 ^ 60 because cq ^ Cj. Moreover, 

6 7 ^ 62 because 62 ^ Is[ (as this s-t-path contains an odd number of edges from S[riCh = {eo} 
and from S[r\Ci = {eo, 62 })- So e ^ C,. 

Suppose S[riCq = {e} for some q (otherwise S[ has type GOOD at Cj because 62 guarantees 
/ + r > 1); then q > k (as 62 G C, U • • • U C^), and from (*) we get e ^ E{P)- Then we have 
6,62 G 5{ n Cj and \E{P) n Cj\ is odd and 6,62 ^ E{P). This implies \S[ n Cj\ > 3, so again S[ 
has type GOOD at Cj. The claim is proved. 

The claim directly implies (c) and (d) for j G {i + 1,..., /c}. 

Now let J G {k + 1,... — 1}. Suppose 5i and S[ have different types at Cj. Since I, m, r (cf. 

Definition [ 8 ]) are identical at Cj, the only remaining reason for different types is that S'lnCl,' = {e} 
(and thus 5{ PI Cj/ = (e, 62 }) for some i < j' < k and 6 G 5i n C,- = 5( n Cj. But then the new 
type is GOOD. 

Now suppose S 2 and S 2 have different types at Cj for some j G {A: + 1,..., — 1}. Again 

I, m, r are identical at Cj, and the only remaining reason for different types is that 5^ H Cji = {e} 
(and thus S 2 H Cj/ = {e, 62 }) for some i < j' < k and 6 G 52 P Cj = 5^ P Cj. But then e ^ 
because contains an odd number of edges in Cj/ and must contain 62 . This implies that 52 PCj 
contains another edge in addition to 6 , and hence P Cj| = |52 P Cj| >2. Therefore the new 
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type can only be 120 or 121 or 220. 


□ 


3.3 Resulting types 

Let := E 5 e 5 :S has type rate 1^5 for every type r and C G \ {(5(s), 5(t)}. We apply the 
previous lemma from left to right, and at each cut as long as possible. In order to obtain a 
polynomial-time algorithm, we will round all ps down to integer multiples of for some small 
positive constant e. 

Corollary 10 For any constants | < ^ < 2 and e > 0 there is a polynomial-time algorithm which, 

given an instance {V,s,t,c) and an optimum solution x* of ([T]), a set C S and numbers ps 

2 _ — 

with ^ps G N for S G 5+ and YlseS+PS — computes another set 5+ C S and numbers ps 

with ^ps G N /or 5 G S+, such that Y^seS+PSX^ = T>seS+PSX^ and 

min{po^i,pf^o} = 0 and Piw < Puo + Pgood and P 021 < P 021 + Pooou (15) 
for allC eC^\{6{s),d{t)}. 


Proof: We first compute the .^-narrow cuts (cf. Proposition [2]) and process them from left to 
right, ignoring 5{s) and S{t). Initially, 5+ := 5+ and p := p. At each Ci {1 < i < P — 1) we 
check the types of all trees S G Whenever we have two such trees S'! and S 2 of type 120 and 
Oil, respectively, we set 5 := min{p 5 j,p 52 }) decrease ps^ and ps 2 by 6, and increase and P 5 / 
by 6, where S[ and 5^ are chosen as in Lemma [9l If 5^ but not S 2 has type 110 or 021 at Cj 
for some j > i, then S[ has type GOOD at each of Cj+i, ... ,Cj, and we maintain the properties 

mm{p^fl,p^ 2 o} = 0 for /i < i and p^fQ < p^f^ + Y^s&S'Ps and p^f^ < p^f-^ + Y^seS'Ps fo^ all j, 

where S' := {S G S, S has type GOOD at Cj and type 121 or GOOD at Cj, ..., Cj-i}. 

We remove a tree S from 5+ if ps drops to zero. Note that at any stage, all ps (and 5) are 

2 — 

integer multiples of so there are never more than ^ trees in S+. □ 

By symmetry, we also have: 


Corollary 11 For any constants | < ^ < 2 and e > 0 there is a polynomial-time algorithm which, 

given an instance {V,s,t,c) and an optimum solution x* of ([1]), a set 5+ C S and numbers ps 

2 _ — 

with ^ps G N for S G 5+ and ^seS+PS — 1> computes another set 5+ C S and numbers ps 

with ^ps G N /or 5 G S+, such that ^seS+PsX^ = T,seS+PsX^ and 

niin{pno>Po^i} = 0 and Pqu < Pan + Pgood and P 120 < P 120 + Pgooo (16) 
for allC GC^\{6 {s),6{t)}. □ 


We conclude: 
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Theorem 12 For any constants | < ^ < 2 and e > 0 there is a polynomial-time algorithm which, 
given an instance {V,s,t,c) and an optimum solution x* of ([1]), computes a set 5+ F S of trees 
such that there exists a distribution p on S with x* = 1*5 = 0 for all S ^ S \ 5+ 

and 

{ Pi 2 o + Po 2 i^ Pmi + Pno) Pmi + Po 2 i^ P 120 + Piio } — Pgoou + ^ (17) 

for allC eC^\{5{s),6{t)}. 

Proof: First compute any distribution p' with x* = YlsesPsl^^ and with less than trees S 
with p'g > 0, and the set 5^ of these trees, using the ellipsoid method (|Grotschel. Lovasz and Schriivei 


1981|) or the splitting-off technique fcf. lGenova and WilliamsonI 2015l ]l. Then set p'f := ^ L^PsJ 
for all S £ S] and let Sf_ be the set of trees with pg > 0. Note that YlseS^Ps ~ P's) ^ 

Then apply Gorollarv 1111 to 5" and p”. We get 5+ and p with min{p|^Q,p,^^} = 0 for all 
C £ C^\{6{s),5{t)}. Applying Corollary [TO] to 5+ and p yields 5+ and p with min{p^]^,p^Q} = 0 
and mi\i{pY^Q,p§ 2 ,i} < Pgood foi^all C £ \ {J(s), Then p := p p' — p" is a distribution 

as required. We output 5+ := 5+ U 5^. □ 

Note that our algorithm only needs the set which contains less than ^ trees. The 
above proof shows that we can also compute the distribution p in polynomial time, but it is needed 
only for the analysis. 


4 Improved approximation ratio 

We now show how to set the numbers ^s,e for S £ S and e £ Is so that the total benefit (according 
to Definition [ 6 |) is large. 

For some constant /3 slightly larger than | (to be determined later), let f{x) := 1 - 2,3 • 

By Lemma [7| and (|3|), an average benefit of f{x*{C)) from the trees will be sufficient for a narrow 
cut C. 

Ideally, we would like to have 'ys,e > f{x*{C)) e £ S CiC and \S H C\ even, and 1 — 'ys^e > 
f{x*{C)) if e E S' n C and |S n G] = 1. But this may be impossible because f{x*{C)) can be 
greater than ^ for /3 > |. Therefore we cut off at More precisely: 

For S £ S and e £ Is "we define two numbers 0 < /i, /2 < If |S n C| > 1 for all C E C with 
e £ C, then fi := 0, otherwise fi := min{T, max{/(a:*(G)) e £ C £ C ,\S r\ C\ = 1}}. If |S n C| 
is odd for all C E C with e £ C, then /2 := 0, otherwise /2 := min{^, max{/(x*(C')) : e £ C £ 
C, |S n C\ even}}. If /2 < /i, then we set 75^6 := f 2 i otherwise we set 'ys,e := 1 — /i- 

4.1 Less critical cuts 

Lemma 13 For all C £ C with f{x*{C)) < ^ we have (1131) . 
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Proof: By the above choice of 75 e® ^ ‘5)) have > f{x*{C)) if \S H C\ is even 

and 1 — 75eg — /(®*(^)) H Cl = 1. Let q := Yls&S'PS where 5' := {5 G 5 : jS" n 
C| even, 75eg}- (^7 Definition [6]) 

'^psbs,c > q • ~ 

s&s ^ 


As pggg > 2 - x*(C) > l^lcy-i ^?ven > (p?ven “ ?) by dS]) and ©, we get 


2-x*(C) in 


'^Psbs,c > q- 

SeS 


/3(2-x*(C)) 

1-2/3 


+ 


f{x*{C)) (^1 + - q) 


/3{2-x*{C))pg, 

1-2/3 


□ 


So we need to analyze only the remaining cuts (with f[x*{C)) > ^ and hence x*{C) ~ 1.5, 
more precisely with | — — ^*(^) — I + — ^)- 


4.2 Most critical cuts 

Now we consider the cuts C £ C with f{x*{C)) > We first establish what we outlined in 
Subsection 12.31 


Lemma 14 Let 0.4 < /3 < 0.5 and 1.7 < ^ < 1.8. Let C £ \ {5(s), <5(t)} with f{x*{C)) > 

and S £ S with |5 n C| = 1 or |S'n C| even. Then: 

• bs,c > i 

• Moreover, if ^ C<_ U C_^ or S' n C' 7 ^ {^c} ^ > then hs,c > 1 — /(O- 


Proof: First let |S n C| = 1. Then bs,c = 1 ~ 75e® • b"or S and e = we have /i = ^ > / 2 , so 
75 ,eg = h<\- Moreover, if eg ^ C^ U C_^, then /2 < /(O- 

Now let |Sn C| be even. Then hs,c '■= |^^T^|^)75eg|- 

We first observe that x*{C) < 1.7 and f{ff) > /(1.8) > 0.32 and hence ^ 


^>f >1-/(0- 

Now consider the second term. We have f 2 = \ > /i, so 7 ^gS = 1 — /i > Moreover, 


if 


S n C' 7 ^ lor all C £ (this holds in particular if |S n C| 7 ^ 1 and eg ^ U C_^), then 


h < /(O- 


□ 


Now we can bound the benefit for most critical cuts, of course using a distribution according 
to Theorem [T 2 I 
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Lemma 15 Let 0.4 < P < 0.5 and 1.7 < ^ < 1.8 such that v := 1 — /(^) > Let e > 0. Let p be 
a distribution on S with x* = '^seS (I17p . Let C G \ {J(s), J(i)} with f{x*{C)) > ^ 
and x*{C) > 2 — |. Then 


‘^^Psbs,c > 1 + ^5 - ^(x*(C')+ 0 - e 

S G<S 



(4zy- l)p^any 


(18) 


Proof: As before, let and be the adjacent ^-narrow cuts left and right. 

Let Is = |5nCnC^-l, ms = ISnCl, and = |S'nCn C^\. Note that Yhs&sP^^s = x*{C) 
and, using Lemma [3l J2s&sPs^s = x*{C^- n C) < ^{x*{C<-) + x*{C)) — 1 < + x*{C)) — 1. 

Analogously, ^s&sPS'^S < ^(^ + - 1- 

For all S' G 5 with ms > 3, we have 

265,0 - (m5 + l)(z^ - i) + (4i/- 1)[^2^J > 1- (19) 

(If ms is odd, then bs,c = 0 and thus 26s^c' — (m 5 + l)(i/ — ^) + (4i/ — 1) ~ —3z/+l > 1. 

If ms is even, then bs^c > ^ and thus 265 ^ 0 ' —(m 5 + l)(z^ — |) + (4i^ — l) J > msn — 5u+^ ^ §■) 

Now we distinguish four cases (cf. (|17p l. 


Case 1: ^^20 +15^21 < Pgood + e- 

For S G S, let 05 = 1 if S has type 120 or 021 at C, as = —1 if S has type GOOD at C, and 
05 = 0 otherwise. Note that Yls&sPS^S < 

Then for all trees S G S we have (cf. Lemma [14] and Tabled]): 

• "15 > 3, or 

• bsfi > ^ and Is + rs - ms + 05 > 0 , or 

• bs,c > " and Is + rs - ms + as > - 2 , 
and hence (using (I19p for the case ms > 3) 

265,0 + (^5 T^-S - m-s+ a 5 )(" - ^) + ( 4 " - 1)L^^^^J > 1- 

Taking the weighted sum, this implies (using (| 6 |)) 

2 Psbs,c > 1 + (2 - C - e) (z^ - 4) - (4i/ - l)pSany 

As x*{C) > 2 — I implies 2 — ^ > 5 — |(x*(C) + ^), we obtain (fTHP . 

Case 2: pg^ + pg^ 

— Pgood 4” 

For S G S, let 05 = 1 if S has type 011 or 110 at C, as = —1 if S has type GOOD at C, and 
05 = 0 otherwise. Note that J2s&S PS^S < 

Then for all trees S' G 5 we have (cf. Lemma dH and Tabled]): 
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bs,c 

1 

m 

r 

Case 1 

Case 2 

Case 3 

Case 4 


benefit 

number of edges in 

1 + r — m 

1 + r + m 

21 + r 

l + 2r 

type 

> 


c 
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> 

010 
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0 
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2 
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2 

2 

2 

0 

0 

4 

4 

2 
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1 

2 

1 

2 

1 

0 

4 

3 

3 

GOOD 

V 

1 

2 

>1-1 

-1 

2 

1 

1 

GOOD 

1 

2 

1 

2 

>3-1 

1 

4 

3 

3 

GOOD 

0 

> 0 

m > 3 

> 0 

—m 

m 

0 

0 


Table 1: The different types and their contributions. The bounds on the benefit (second column) 
follow from Lemma [TTl here we use f{x*{C)) > After reassembling, except for an e fraction of 
the trees, the two orange types (Oil and 120) cannot occur simultaneously, and if the two purple 
types (110 and 021) occur simultaneously, this is compensated by GOOD types. 


• fns > 3, or 

• bs,c > I and Is + rs + ms + as >3, or 

• bs,c > and Is + rs + ms + 05 > 1 , 

and hence (using again ([T9]) for the case ms > 3) 

2 bs,c + {Is + rs+ ms + as - 3){i^ - ^) + > 1. 

Taking the weighted sum, this implies 

2 Psbs,C > l + i5-2x*{C)-C- e) {l^ - - (4z^ - l)Pmany> 

SeS 

implying (fTS]) because ^ > x*{C). 
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1 if S' has type GOOD at C, and 


Case 3: + p^ 2 i < Pgood + e. 

For S G S, let 05 = 1 if S has type Oil or 021 at C, as = — 
as = 0 otherwise. Note that Yls&sPS^S < 

Then for all trees S G S we have (cf. Lemma [TT] and Tabled]): 

• rns > 3, or 

• bs,c > \ and 2ls + rs + as > 2, or 

• bs,c > and 2ls + rs + as > 0, 

and hence (using once more (|19p for the case ms > 3) 

2bs,c + {2ls + rs + as-2){iy-^) + {4u-l)[^\ > 1 . 

Taking the weighted sum, this implies 

2 ^ Psbs,C > 1 + (5 - I (X* (C) + 0 - e) - 5) “ ( 4 ^^ “ l)Pmany, 

ses 

i.e., (dH). 

Case 4: + p^Q < p^ood + ^• This is symmetric to Case 3. 

So ()18p is proved in all cases. □ 

4.3 Setting the constants 

We now obtain our main result easily: 

Theorem 16 If ^ = 1.73 and e = 0.01 and p is a distribution as obtained in Theorem I ill then 
BOMC(p) < (2 —/3)c(x*) for j3 = 0.401. In particular, we have an 1.599-approximation algorithm 
for the s-t-path TSP, and the integrality ratio of 0 is at most 1.599. 

Proof: We have = 1 — /(^) > 0.6. We use Lemma [7| and need to show (I13p . 

Let C G C be a narrow cut. If f{x*{C)) < we have shown (I13p in Lemma [T3l So let now 
C G C be a narrow cut with f{x*{C)) > Note that 1.44 < x*{C) < 1.56 and thus in particular 
C G \ {6{s),6{t)} and x*(C) > 2 — We apply Lemma fTHl to C. 

The constants e, and /3 are chosen so that 

l+(5-|(i--(C)+{)-e()(^-0 > (20) 

holds for all values of x*(C). 
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Moreover, 13 > 6 + 4 ^ 2 =?)’ 4/3(2-0 > 3-6/3-4/3(2-0(^-1) and hence j^{2-x*{C)) > 

Therefore, using ([6]) and (jj]), 


_ c'l > 3-6/3-4/3(2-g)(g-l) _ 3-4/(g) _ 4^-1 
1-2/9 ^ 2(1-2/3) ~ 2 ~ 2 


2/3(2 x*{C))pg^_^ ^ 


1-2/3 


= P, 


many 

c (2P{2-x*{C)) 4u-l 


- 


1 - 2/3 


Ah' — 1 


< ix*iC)-l) 
2/3 


2/3(2-xyC)) 


1-2/3 2 

(xyc)-i)(2-xyc-)). 


1 - 2/3 

Together with (fTH]) and (f20|) . this directly implies (fT^ . 


□ 


5 Enhancements 

The constants ^ and /3 in the previous section are not optimal, but they are close. The bounds 
are almost tight for Case 3 (and 4) and x*{C) ~ 1.52 and p^any = 0- However, we now suggest 
two ideas for a refined analysis that l ea ds to a further improvement. 

Firstly, since (in contrast to Sebd ’s |2013l | analysis) the worst case does not occur in x*{C) = 
1.5, but in a slightly larger value, one can increase /3 and hence improve the approximation ratio 
by increasing the y-values slightly. 

Secondly, the analysis in Case 3 (and 4) of Lemma [15] can be refined (and the analysis was not 
tight in Case 1 and 2 anyway), as we will indicate now. Consider a critical cut C in Case 3, and 
assume for simplicity p^any = 0- Het | < 2 with f{g) = and let Cg be the next ^'-narrow 

cut to the left of C. We have benefit i for at most a ia:(C'nC'i^)+x(C'nC'|() < |®(C') + 4^+^^' —| 
fraction of the trees, and larger benefit for the others. More generally, we have benefit at most 
1 — f{y) for at most a 4x(C' C Cg) + x{C C Cg) < jx{C) + + I?/ — f fraction of the trees, for 

all y G [^^^]• See Figure 5. 

We have not performed the necessary calculations to obtain the best possible approximation 
ratio with these ideas. However, it seems that the resulting improvements are rather small. For a 
much better bound, we would probably need stronger reassembling results. 


6 Discussion 


Theorem [T6l readily leads also to an i mproved approximation ratio for th e pri ze-collecting g-t-path 
TSP, simply by applying Theorem 6 of An. Kleinberg and Shmovs 20151 ] . See Guttmann-Beck et al 
' 200d | for further applications. 
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Figure 5: Refined analysis of Case 3. 


Theorem [T3 improves the best known upper bound on the integrality ratio of the LP ([T]) . The 
best known lower bound is |, shown by the metric closure of an unweighted circui t. For metric clo¬ 
sures of unweighted graphs the integrality ratio is indeed | (even for T-tours), as ISebd and Vygen 


20141 ] proved. For general metrics, this remains open. 


The most natural open question is of course to improve the approximation ratio further. Our 
improvement was only small, but the reassembling technique could be more powerful than we 
were able to prove. It seems that a stronger version of Lemma [9] would be need ed. It w ould a lso 


be interesting to generalize our algorithm to the T-tou r problem for general T (ISebdI’s 1201311 


approximation algorithm and the previous algorithm of ICherivan. Friggstad and Gad 20151 ] work 
also for this more general problem). Finally, applying our technique to other TSP variants would 
be very interesting. 
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Appendix: an Example 


The following example shows that we cannot get more benefit than ISebd |2013l | , no matter how 
we choose the numbers 7s,ej if we represent the LP solution x* in an arbitrary way as convex 
combination of spanning trees. 

We show one instance with 20 vertices and 30 edges. From this we obtain an inhnite sequence 
of instances by extending the wall-like part in the middle, inserting copies of the blue part, adding 
4 vertices and 6 edges in each step. 

The number next to an edge e is x*{e)] then x* is a feasible solution of ([T]). Grey vertical lines 
show the narrow cuts. We have x*{C) = | for all C E C \ {(5(s), 6{t)}. 



Consider the red sets in the following figure and all singletons; call this set of sets U. Then 
the 30 vectors 5(JJ), U & U, are easily shown to be linearly independent. Since x* satisfies all 
constraints of ([I]) that correspond to these sets with equality, x* is indeed a vertex of the polytope 
dehned by Hence it is an optimum solution for some objective function. 



Now we will show a (bad) decomposition of x* into incidence vectors of trees. We have 
X* = ' where ps = \ for the following four trees and ps = 0 for all other spanning 

trees. In the wall-like part (grey cuts 6-9 from left in the top figure), the four trees have types Oil, 
110, 021, and 120. For each of these four trees S, each edge e of its s-t-path Is within the wall-like 
part belongs to one narrow cut C with IC n S'! = 1 and one narrow cut C with \C' n 5| = 2. No 
matter how we choose 7s,e, the total benefit of this edge to both cuts is 1, so the average benefit 
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that a narrow cut in the wall-like part receives is at most This shows that reassembling trees 
is necessary. 

We were unable to prove that the best-of-many Christofides algorithm has no better approx¬ 
imation ratio than 1.6, but to obtain a better ratio a completely different analysis would be 
necessary. In fact, no better lower bound than 1.5 is known. 
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